Maximum likelihood estimation of locus-specific mutation rates in Y-chromosome short tandem repeats
نویسندگان
چکیده
MOTIVATION Y-chromosome short tandem repeats (Y-STRs) are widely used for population studies, forensic purposes and, potentially, the study of disease, therefore knowledge of their mutation rate is valuable. Here we show a novel method for estimation of site-specific Y-STR mutation rates from partial phylogenetic information, via the maximum likelihood framework. RESULTS Given Y-STR data classified into haplogroups, we de-scribe the likelihood of observed data, and develop optimization strategies for deriving maximum likelihood estimates of mutation rates. We apply our method to Y-STR data from two recent papers. We show that our estimates are comparable, often more accurate than those obtained in familial studies, although our data sample is much smaller, and was not collected specifically for our study. Furthermore, we obtain mutation rate estimates for DYS388, DYS426, DYS457, three STRs for which there were no mutation rate measures until now.
منابع مشابه
Maximum-likelihood estimation of site-specific mutation rates in human mitochondrial DNA from partial phylogenetic classification.
The mitochondrial DNA hypervariable segment I (HVS-I) is widely used in studies of human evolutionary genetics, and therefore accurate estimates of mutation rates among nucleotide sites in this region are essential. We have developed a novel maximum-likelihood methodology for estimating site-specific mutation rates from partial phylogenetic information, such as haplogroup association. The resul...
متن کاملTowards Improvements in the Estimation of the Coalescent: Implications for the Most Effective Use of Y Chromosome Short Tandem Repeat Mutation Rates
Over the past two decades, many short tandem repeat (STR) microsatellite loci on the human Y chromosome have been identified together with mutation rate estimates for the individual loci. These have been used to estimate the coalescent age, or the time to the most recent common ancestor (TMRCA) expressed in generations, in conjunction with the average square difference measure (ASD), an unbiase...
متن کاملDinucleotide repeats in the Drosophila and human genomes have complex, length-dependent mutation processes.
We use methods of maximum likelihood estimation to fit several microsatellite mutation models to the observed length distribution of dinucletoide repeats in the Drosophila and human genomes. All simple models are rejected by this procedure. Two new models, one with quadratic and another with piecewise linear slippage rates, have the best fits and agree with recent experimental studies by predic...
متن کاملY-chromosome-specific microsatellite mutation rates re-examined using a minisatellite, MSY1.
Polymorphic Y-chromosome-specific microsatellites are becoming increasingly used in evolutionary and forensic studies and, in particular, in dating the origins of Y-chromosomal lineages. Previously, haplotyping of Y chromosomes from males belonging to a set of deep-rooting pedigrees was used to estimate a conservative average Y-chromosomal microsatellite mutation rate of 2.1 x 10(-3)per locus p...
متن کاملChromosome-wide characterization of Y-STR mutation rates using ultra-deep genealogies
Although the utility of short tandem repeats on the Y-chromosome (Y-STRs) has long been recognized and leveraged in forensics, genealogy and paternity testing, the bulk of these applications have relied on only a few dozen loci identified as having remarkably high mutation rates. Recent efforts have expanded the set of Y-STRs with known mutation rates to two hundred markers, but the limited thr...
متن کامل